SEARCH REQUEST FORM 

Scientiflc and Technical Information Center 



Reoucsicr s Fii,ll Name Ns ^<ggy^4fe^ (g. hxaminci - :^2z2l^ Dale: [(-I'^Oj ^ 

^nUit / C5H Phone Number 30_Jji25-2H.. Ser.<<l Number Ic^ In'^l TOSl^ 

Mai] Box and Rids Room Location Rcsiilis Formal Preferred .circle ) PAPEHTdIS^QE-MAIL 

If more than one search is submitted, please prioritize searches in order of need. 

• »»*»**#***««***«*«ir*T«****»******************«**«*»*«w*»«**** * ************ 

Picoic r>rov lac j iktoilcd s^a;*. n'.cni of" the search lopic and descr ibe o^ ^Pk'c:nc.ill-> pr>s^ihlc the Mibicci mancr to be searched 
Include !hc cicc'cd specie or ^micturcs. keywords, synonyms, acrcknyms. ^nd rciiisnrx' numbers, and combine with the concept or 
uMlits of the invention Dcfme any terms that may have a special meaning Qw t examples or relevant citations, authors, etc. if 
known Please attach a copy oi' the cover sheet, pcnincnt claims. «nd abstract 

Title of Invennon Cxi^y^ he^\J^r<^ (o ^f^^ cAr^^^ Or Af rvt>r«y\Q_^^gr''^^^^"T ?<r>i^H 
Inventors tplcjiit- proMoc fill! naniesl 



Earhesi Prior)!\ Filing Dale 7^ 2. X 
\ce Sfarclici O'lt) ' PIcaie tnduJe ail pertinent 

|/v 4~<- O.J. ^.jfe^^^^lk^i^e^ Je^""-«- c)=^^«^'^J<- (^^'-^^'^^ 



"fnr <lKquenee Sfarclia Qui) ' Please include ait pertinent tu formation (parem. chilJ. divisional, or mued patent numbers) atpttg with the 
npprfipnotr ^c'taf <Mfnh.ct 




ST \ FF LSF. OM-V Type of Search Vctidors and cost %\ncre appHc-^iblc 

\ti'.:'\z- SA Sequerice ic)_ ... _ 



Stj,;-.. p-yr.e - ^ AA Sequcr.ce !!?=»?pc: 

Sti'.; -f Icc j; f ' , $fT\jCiu'< f«5 0-:C"C <'>''>i- ._ . _ , 



Bit: < 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: November 12, 2003, 14:22:54 ; Search time 34 Seconds 

(without alignments) 
53,128 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-031-505-6 
36 

1 PKKKRKV 7 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0,5 



Searched: 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 



830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 23:* 



1 




sp archea:* 


2 




sp bacteria : * 


3 




sp fungi : * 


4. 




sp_human: * 


5 




sp_invertebrate : * 


6 




sp mammal:* 


7 




sp mho:* 


8 




sp organelle:* 


9 




sp^phage: * 


10: 


sp plant:* 


11: 


sp rodent : * 


12: 


sp virus:* 


13: 


sp vertebrate:* 


14: 


sp unclassified:* 


15: 


sp rvirus:* 


16: 


sp bacteriap:* 


17: 


sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


36 


100 . 


0 


85 


11 


Q64 14 1 


yb4i4i racnus sp. 


2 


36 


100 . 


0 


134 


11 


Qb4 142 


yt34l4Z j:ai.t.us sp . 


3 


36 


100 . 


0 


1 /D 




Qyw /w / 


yyw/w/ siiuian viru 


4 


36 


100 . 


0 


259 


5 


Q9NBV7 


Q9nbv7 schistosoma 


5 


36 


100 . 


0 


537 


12 


Q9dZP / 


Q98zp7 simian viru 


6 


36 


100 . 


0 


705 


12 


Q9 J /C9 


Q9j7c9 simian viru 


7 


36 


100 . 


0 


7 0/ 


12 


0!)Dozb 


uddjzd Simian vxru 


8 


36 


100 . 


0 


T r\ o 
7 U 0 


12 


Qy DH / U 


Q9dh70 simian viru 


9 


36 


100 . 


0 


709 


12 


Q9WoGo 


Q9w8g8 simian viru 


10 


36 


100 , 


0 


711 


12 


Q9WND0 


Q9wnd0 simian viru 


11 


36 


100 . 


0 


/II 


1 o 

i Z 


Qo JMl y 


Q8jmi9 simian viru 


12 


36 


100 . 


0 


111 
/ 1 1 




yyVv oVl 


Q9w8vl simian viru 


13 


3d 


100. 


u 


1 1 DC) 


1 n 
1 U 


('^ Q ^ A T»7 

V2 y AW 2 


Q9caw2 arabidopsis 


14 


35 


97 . 


2 


O T O 

2 /o 


12 


QyyMM / 


Q9ymm7 lymantria d 


15 


•a C 

35 


97 . 


2 






Qy bvvu 


Q96vv0 fusarium ox 


16 


35 


97 . 


2 


5 4 <^ 


3 




/~\ftAQQO ■^^t^->t/-A^^wt 

yuuoo2 lusarium ox 


17 


35 


97 , 


2 


loZlo 


c 

J 


Q9Nc S3 


Q9nfs3 drosophila 


18 


35 


97 . 


2 


180/4 


r 

o 


Q9I /U4 


yyi /u4 arosopniia 


19 


33 


91 - 


, 7 


145 


16 


Q9IiAK6 


Q91ak6 corynebacte 


20 


33 


91 . 


7 


177 


5 


Q8ToR9 


Q8t8r9 drosophila 


21 


33 


91 . 


. 7 


212 


17 


Q8TZY3 


Q8tzy3 pyrococcus 


22 


33 


91 . 


7 


213 


1 / 


Oo94 bl 


059461 pyrococcus 


23 


33 


91 . 


, 7 


229 


5 


Q8IMW8 


Q8imw8 drosophila 


24 


33 


91 . 


, 7 


286 


10 


Q9SIU9 


Q9siu9 arabidopsis 


25 


33 


91 , 


7 


r>t Ci r\ 

290 


1 1 


Q9WTX1 


Q9wtxl mus musculu 


26 


33 


91 . 


, 7 


3d1 


J 


O13oob 


uiooob scnizosaccn 


27 


3i 


y 1 . 


/ 


/I O 1 


c 
0 


yy VCO D 


Q9vc85 drosophila 


28 


33 


91 , 


, / 


4 / U 




O X T T CO 

y oNl b 0 


Q8nis8 neurospora 


29 


33 


91 , 


, 7 


504 


10 


Q9MUV4 


Q9m0v4 arabidopsis 


30 


33 


91 , 


, 7 


^ ij: o 
662 


1 1 


Qy iwwb 


Q91ww6 mus musculu 


31 


33 


91 . 


, / 


^ 1 o 

D 72 


c 


04 4 loy 


vj44ioy caenornaDcii 


32 


33 


91 , 


•"7 


^ o o 
boo 


LZ 


ft 1 TMT" /I 

Qy lUr 4 


Q91df4 polyomaviru 


33 


33 


91 , 


, / 


f o o 

boo 


12 


Qo JUCO 


Q8juc5 polyomaviru 


34 


33 


9 1 . 


. / 


CO Q 

boo 


12 


Qo JUDO 


Q8jud5 polyomaviru 


35 


33 


9 1 . 


. 7 


c a Q 

boo 


12 


Q8 JJ JO 


Q8jjj0 polyomaviru 


oo 




Q 1 

y 1 ^ 




Q Q 

boo 


1 o 

iz 


r\ ftr.TMft "7 


yywmu / poiyomaviru 


37 


33 


9 1 , 


, / 


^ o o 

boo 


12 


QoVbGb 


Q8v6g6 polyomaviru 


J 0 




^ i. < 




oo D 


1 9 


VO o v» u *1 


JVVJ*4 J. y ^JlUcl V X i. U 


39 


33 


91, 


,7 


688 


12 


Q9WM08 


Q9wm08 polyomaviru 


40 


33 


91. 


.7 


688 


12 


Q91DF8 


Q91df8 polyomaviru 


41 


33 


91. 


,7 


688 


12 


Q8JUD8 


Q8jud8 polyomaviru 


42 


33 


91. 


,7 


688 


12 


P88898 


P88898 polyomaviru 


43 


33 


91. 


.7 


688 


12 


Q8JUE2 


Q8jue2 polyomaviru 


44 


33 


91. 


.7 


688 


12 


Q91NL8 


Q91nl8 polyomaviru 


45 


33 


91. 


.7 


688 


12 


Q91DF5 


Q91df5 polyomaviru 



ALIGNMENTS 



RESULT 1 
Q64141 

ID Q64141 PRELIMINARY; PRT; 85 AA. 

AC Q64141; 

DT Ol-NOV-1996 (TrEMBLrel. 01, Created) 

DT Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last annotation update) 



DE T2 antigen (Fragment) . 

GN T2 ANTIGEN. 

OS Rattus sp. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10118; 

RN [1] 

RP SEQUENCE FROM N,A. 

RX MEDLINE=95347348; PubMed=7542587; 

RA Eul J./ Graessmann M. , Graessmann A.; 

RT "Experimental evidence for RNA trans-splicing in mammalian cells."; 

RL EMBO J. 14:3226-3235(1995). 

DR EMBL; S79053; AAB34943.1; 

DR InterPro; IPR003133; T_Ag_DNA_bind. 

DR Pfam; PF02217; T_Ag_DNA_bi nd; 1. 

FT NON__TER 8 5 85 

SQ SEQUENCE 85 AA; 9740 MW; 5E356405DD647198 CRC64; 

Query Match 100.0%; Score 36; DB 11; Length 85; 

Best Local Similarity 100.0%; Pred. No. 6.9; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I i I I I I I 
Db 18 PKKKRKV 24 



Search completed: November 12, 2003, 14:25:12 
Job time : 36 sees 



GenCore version 5.1,6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: November 12, 2003, 14:22:54 ; Search time 11 Seconds 

(without alignments) 
29.926 Million cell updates/sec 

Title: US-10-031-505-6 
Perfect score: 36 
Sequence: 1 PKKKRKV 7 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SwissProt_41 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


36 


100. 


0 


236 


1 


GT6 SCHMA 


P46435 


schistosoma 


2 


36 


100. 


0 


691 


1 


TALA_POVBA 


P14999 


polyoma viru 


3 


36 


100. 


0 


695 


1 


TALA_POVBK 


P03071 


polyomaviru 


4 


36 


100. 


0 


708 


1 


TALA_SV4 0 


P03070 


simian viru 


5 


33 


91. 


7 


384 


1 


YUYl^CAEEL 


P54073 


caenorhabdi 


6 


33 


91. 


7 


688 


1 


TALA_POVJC 


P03072 


polyomaviru 


7 


33 


91, 


7 


1083 


1 


T2D3 HUMAN 


000268 


homo sapien 


8 


33 


91, 


7 


1872 


1 


T2D1_HUMAN 


P21675 


homo sapien 


9 


32 


88. 


9 


91 


1 


RK23 MARPO 


P06390 


marchantia 


10 


32 


88. 


9 


115 


1 


IFIA PYRAB 


Q9vl38 


pyrococcus 


11 


32 


88. 


9 


115 


1 


IFIA PYRFU 


Q8u0k5 


pyrococcus 


12 


32 


88. 


9 


115 


1 


IF1A_PYRH0 


059280 


pyrococcus 


13 


32 


88. 


9 


124 


1 


WNT3__EVATR 


P28090 


evas terias 


14 


32 


88. 


9 


178 


1 


RPOE LISIN 


Q927t3 


listeria in 


15 


32 


88. 


9 


178 


1 


RPOE_LISMO 


Q8y494 


listeria mo 


16 


32 


88. 


9 


196 


1 


RAC2 LOTJA 


Q40220 


lotus japon 


17 


32 


88. 


9 


355 


1 


SUR6 MOUSE 


P70279 


mus musculu 



18 


32 


88 . 


9 


361 


1 


SURo HUMAN 


O / J Do o 


homo sapien 


19 


32 


88. 


9 


00 U 


i 




y yaoz 1 


inus iTiusculu 


20 


32 


88. 


9 


ODJ5 


1 


\/T~\CO VCA C"T* 




s a cchs roinyc 


21 


32 


88. 


9 


1517 


1 


RPOC CAMJE 


Q9pi30 


campylobact 


22 


32 


88. 


9 


2434 


1 


YCFl OENHO 


Q9mth5 


Oenothera h 


23 


31 


86. 


1 


501 


1 


GUAA THEMA 


Qyx^eu 


thermotoga 


24 


31 


86. 


1 


517 


1 
1 


GUAA LACRH 


Oo5iyz 


lactobacill 


25 


31 


86. 


1 


con 


1 


GUAA 5 1 H F o 


Qo K /eb 


s t reptococc 


26 


31 


86.1 


520 


1 


GUAA STRPY 


Q992 j 7 


streptococc 


27 


31 


86. 


1 


521 


1 


GUAA NEIMA 


^ ft ^ • -/"ft 
Q9 jwoO 


neisseria m 


28 


31 


86. 


1 


521 


1 


GUAA NEIMB 


Q9 jxr2 


neisseria m 


29 


31 


86. 


1 


coo 

522 


1 


KTRb YEAST 


n C o ft ^ ^ 

P539d5 


saccharomyc 


30 


31 


86. 


1 


524 


1 


GUAA YEAST 


P38625 


saccharomyc 


31 


31 


86.1 


960 


1 


K682 HUMAN 


Q9y4c8 


homo sapien 


32 


30 


83. 


3 


66 


1 


RL35 TREPA 


/-V ft O ft ft T 

083821 


treponema p 


33 


30 


83. 


3 


1 02 


1 


RK2i ODOSI 


r> x ft C C ft 

P4 9559 


odontella s 


34 


30 


83. 


3 


110 


1 


RK23_PORPU 


P51312 


porphyra pu 


35 


30 


83. 


3 


1 O ft 

129 


1 


r. T 1 TV ^T^m C 

RSll AGRTb 


Q8ue4 0 


agrobacteri 


36 


30 


83. 


3 


129 


1 


RSll BRUME 


Q8ynl7 


brucella me 


37 


30 


83. 


3 


129 


1 


RSll RHILO 


Q98n34 


rhizobium 1 


"3 Q 
OO 


o U 


83. 


3 




1 
1 


Del 1 DUTKyftr 


y y z ow / 


rhizobium m 


39 


30 


83. 


3 


130 


1 


RSll THEMA 


Q9xli4 


thermotoga 


40 


30 


83. 


3 


139 


1 


RS12_MYCPN 


P75546 


mycoplasma 


41 


30 


83. 


3 


158 


1 


SSRP BIFLO 


Q8g540 


bif idobacte 


42 


30 


83. 


3 


289 


1 


WR38__ARATH 


Q8gwf 1 


arabidopsis 


43 


30 


83. 


3 


341 


1 


C0A2_BFDV 


P13892 


budgerigar 


44 


30 


83. 


3 


353 


1 


RFC2_YEAST 


P40348 


saccharomyc 


45 


30 


83. 


3 


498 


1 


CIKW_DROME 


P17972 


drosophila 



ALIGNMENTS 



RESULT 1 




GT6_ 


SCHMA 




ID 


GT6 SCHMA STANDARD; PRT; 236 AA. 




AC 


P46435; 




DT 


Ol-NOV-1995 (Rel. 32, Created) 




DT 


Ol-NOV-1995 (Rel. 32, Last sequence update) 




DT 


16-OCT-2001 (Rel. 40, Last annotation update) 




DE 


Glutathione S-transf erase 6 (EC 2.5.1.18) (Fragment). 




GN 


GST- 6. 




OS 


Schistosoma mansoni (Blood fluke) . 




OC 


Eukaryota; Metazoa; Platyhelminthes ; Trematoda; Digenea; 


Strigeidida; 


OC 


Schistosomatoidea; Schistosomatidae; Schistosoma. 




OX 


NCBI TaxID=6183; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RA 


Vande Waa E.A., 0*Leary K.A., Kessler M.A. , Schuler L.A. 




RA 


Goes A.M., Mandal C, Doughty B.L., Tracy J.W. ; 




RL 


Submitted (XXX-1995) to the EMBL/GenBank/DDBJ databases. 




CC 


-!- CATALYTIC ACTIVITY: RX + glutathione = HX + R-S-glutathione , 


CC 


SIMILARITY; NONE TO THE GST SUPERFAMILY. 




cc 
cc 

cc 


SIMILARITY: STRONG, TO YEAST MAK16. 




This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormatics and the 


EMBL outstation - 



CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L06180; AAA29887.2; -. 

DR InterPro; IPR006958; Makl6. 

DR Pfam; PF04874; Makl6; 1. 

KW Transferase. 

FT NON^TER 1 1 

SQ SEQUENCE 236 AA; 28286 MW; 



55756FDDD6085023 CRC64; 



Query Match 100.0%; Score 36; DB 1; Length 236; 

Best Local Similarity 100.0%; Pred. No. 3.7; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 PKKKRKV 7 

i I I I I I 1 
222 PKKKRKV 228 



Search completed: November 12, 2003, 14:25:35 
Job time : 12 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



November 12, 2003, 14:22:54 ; Search time 21 Seconds 

(without alignments) 
32,056 Million cell updates/sec 

US-10-031-505-6 
36 

1 PKKKRKV 7 
BLOSUM62 

Gapop 10,0 , Gapext 0.5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100-^ 
Listing first 45 summaries 



283308 



Database 



PIR__76:* 
pirl:^ 
pir2:^ 
pir3:^ 
pir4 : ^ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


36 


100.0 


350 


2 


JC7213 


Cre recombinase pr 


2 


36 


100.0 


691 


1 


TWPAS 


large T antigen - 


3 


36 


100.0 


695 


1 


TWPTB 


large T antigen - 


4 


36 


100.0 


708 


1 


TWPT4 


large T antigen - 


5 


35 


97.2 


278 


2 


T30450 


late expression fa 


6 


35 


97.2 


542 


2 


S20466 


hypothetical prote 


7 


33 


91.7 


213 


2 


E71190 


hypothetical prote 


8 


33 


91.7 


286 


2 


D84591 


hypothetical prote 


9 


33 


91.7 


361 


2 


T38005 


hypothetical prote 


10 


33 


91.7 


504 


2 


A85068 


U3 snoRNP-associat 


11 


33 


91.7 


672 


2 


T32557 


hypothetical prote 


12 


33 


91. 7 


688 


1 


TWPTJ 


large T antigen - 


13 


33 


91.7 


746 


2 


T19409 


hypothetical prote 



14 


33 


91.7 


754 


2 


B88252 


protein F44G4.1 [i 


15 


33 


91.7 


1865 


1 


148155 


transcription init 


16 


33 


91.7 


1893 


1 


A4 02 62 


transcription init 


17 


32 


88.9 


91 


1 


R5LV23 


ribosomal protein 


18 


32 


88.9 


115 


2 


B75179 


translation initia 


19 


32 


88.9 


116 


2 


H71034 


probable translati 


20 


32 


88.9 


132 


2 


S43488 


homeotic protein L 


21 


32 


88.9 


137 


2 


PC1287 


homeotic protein n 


22 


32 


88.9 


140 


2 


T43768 


ribosomal protein 


23 


32 


88. 9 


148 


2 


S21401 


homeotic protein - 


24 


32 


88.9 


178 


2 


AC1770 


B. subtilis RNA po 


25 


32 


88.9 


178 


2 


AH1394 


B. subtilis RNA po 


26 


32 


88. 9 


196 


2 


C95322 


hypothetical prote 


27 


32 


88.9 


268 


2 


T32740 


hypothetical prote 


28 


32 


88.9 


426 


2 


T16340 


hypothetical prote 


29 


32 


88.9 


434 


2 


E96826 


hypothetical prote 


30 


32 


88.9 


438 


2 


T47861 


nucleolar autoanti 


31 


32 


88. 9 


629 


2 


A69814 


ABC transporter (A 


32 


32 


88 , 9 


663 


2 


S69626 


hypothetical prote 


33 


32 


88.9 


813 


2 


E85135 


hypothetical prote 


34 


32 


88.9 


1435 


2 


S69632 


regulatory protein 


35 


32 


88.9 


1517 


2 


B81393 


DNA-directed RNA p 


36 


32 


88 . 9 


2241 


2 


T16064 


hypothetical prote 


37 


31 


86. 1 


139 


2 


T03184 


probable high-raobi 


38 


31 


86.1 


365 


2 


A96524 


F11A17.6 [imported 


39 


31 


86.1 


367 


2 


T18839 


hypothetical prote 


40 


31 


86.1 


395 


2 


T50804 


hypothetical prote 


41 


31 


86.1 


501 


2 


G72206 


GMP synthase - The 


42 


31 


86.1 


513 


2 


B96524 


hypothetical prote 


43 


31 


86. 1 


521 


2 


G81971 


probable GMP synth 


44 


31 


86.1 


521 


2 


D81026 


GMP synthase NMB19 


45 


31 


86.1 


522 


2 


S62941 


probable membrane 



ALIGNMENTS 



RESULT 1 
JC7213 

Cre recombinase protein - Chinese hamster 
C;Species: Cricetulus griseus (Chinese hamster) 

C;Date: 09-Jun-2000 #sequence_revision 09-Jun-2000 #text_change 21-Jul-2000 
C;Accession: JC7213 

R;Koresawa, Y.; Miyagawa, S.; Ikawa, M, ; Matsunami, K. ; Yamada, M. ; Shirakura, 
R. ; Okabe, M. 

J. Biochem. 127, 361-312, 2000 

A;Title: Synthesis of a new Cre recombinase gene based on optimal codon usage 
for mammalian systems. 

A;Reference number: JC7213; MUID : 2019824 8 ; PMID: 10731707 
A/Accession: JC7213 
A; Molecule type: mRNA 
A; Residues: 1-350 <KOR> 

A; Experimental source: ovarian tumor cells 
C; Superf amily: phage PI recombinase 



Query Match 100.0%; Score 36; DB 2; Length 350; 

Best Local Similarity 100.0%; Pred. No. 23; 



Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I I 1 I I I I 
Db 2 PKKKRKV 8 

Search completed: November 12, 2003, 14:24:24 
Job time : 22 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: November 12, 2003, 14:22:54 



; Search time 41 Seconds 
(without alignments) 
27.100 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-031-505-6 
36 

1 PKKKRKV 7 



BLOSUM62 
Gapop 10.0 



Gapext 0.5 



1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database : 



A Geneseq_19Jun03 : * 



1 

2 
3 
4 

5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 



/SIDSl/gcgdata/geneseq/ geneseqp-embl/AAl980 . 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AAl98 1 . 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 , 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983. 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 4 . 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AAl98 5 . 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 6 . 

/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 . 

/ SIDSl/gcgdata/geneseq/ gene seqp-embl/AAl 98 8 . 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1991 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 
/ SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1996 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 
/ SIDSl/gcgdata/geneseq/ geneseqp-embl/AA1999 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2000 
/ SI DSl/gcgdata /genes eq/geneseqp-embl/AA2 001 
/ SI DSl/gcgdata /genes eq/geneseqp-embl/AA2 002 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 



DAT:* 
DAT: * 
DAT: * 
DAT:* 
DAT: * 
DAT:* 
DAT: * 
DAT:* 
DAT:* 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed. 



and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


36 


100. 0 


7 


15 


AAR48253 


Subcellular locaii 


2 


36 


100. 0 


7 


16 


AAW11495 


SV 40 large T anti 


3 


36 


100. 0 


7 


16 


AAR80531 


SV4 0 derived nucle 


4 


36 


100. 0 


7 


16 


AAR804S8 


SV40 virus large a 


5 


36 


100.0 


7 


16 


AAR7 6591 


Nuclear localisati 


6 


36 


100.0 


7 


17 


AAW02625 


SV40 large T antig 


7 


36 


100.0 


7 


18 


AAW21930 


SV40 large T-antig 


8 


36 


100. 0 


7 


18 


AAW18265 


SV4 0 T-antigen nuc 


9 


36 


100. 0 


7 


18 


AAW19795 


SV40 virus nuclear 


10 


36 


100 ♦ 0 


7 


18 


AAW14428 


SV40 Large T antig 


11 


36 


100. 0 


7 


19 


ABB7904 6 


SV40 large T antig 


12 


36 


100. 0 


7 


19 


AAW64765 


SV40 nuclear local 


13 


36 


100.0 


7 


19 


AAW37985 


SV40-antigen local 


14 


36 


100. 0 


7 


19 


AAW56400 


Nuclear localisati 


15 


36 


100. 0 


7 


19 


AAW56418 


Nuclear localisati 


16 


36 


100.0 


7 


19 


AAW50657 


SV 40 large T anti 


17 


36 


100. 0 


7 


19 


AAW53468 


SV40 large T antig 


18 


36 


100, 0 


7 


19 


AAW4164 6 


Peptide used in pr 


19 


36 


100. 0 


7 


19 


AAW44276 


Human thymosin bet 


20 


36 


100.0 


7 


20 


AAY6719 5 


SV40 T antigen nuc 


21 


36 


100. 0 


7 


20 


AAY42289 


Simian virus 4 0 (S 


22 


36 


100.0 


7 


20 


AAY27394 


SV40 large T antig 


23 


36 


100.0 


7 


20 


AAY27401 


SV40 nuclear local 


24 


36 


100. 0 


7 


20 


AAY33115 


Huntingdon's chore 


25 


36 


100.0 


7 


20 


AAY2 4 432 


SV40 nuclear local 


26 


36 


100 . 0 


7 


20 


AAY08 4 08 


SV40 nuclear local 


27 


36 


100. 0 


7 


20 


AAW96350 


Nucleus localisati 


28 


36 


100.0 


7 


20 


AAW92809 


EP-893493 Seq ID 3 


29 


36 


100.0 


7 


20 


AAW95380 


Nuclear localisati 


30 


36 


100 . 0 


7 


21 


AAB30402 


Nuclear localisati 


31 


36 


100. 0 


7 


21 


AAB14267 


Nuclear localisati 


32 


36 


100. 0 


7 


21 


AAB15405 


SV40 large T antig 


33 


io 


iUU . u 


/ 


Z 1 


AAB 1 U 4 z 1 


SV40 large T antig 


34 


36 


100.0 


7 


21 


AAY88067 


SV40 large T antig 


35 


36 


100.0 


7 


21 


AAY90961 


SV40 large T cell 


36 


36 


100.0 


7 


21 


AAY90468 


SV40 large T antig 


37 


36 


100.0 


7 


21 


AAY81405 


SV4 0 T antigen nuc 


38 


36 


100.0 


7 


21 


AAY58921 


SV40 large T antig 


39 


36 


100.0 


7 


21 


AAY54042 


Nuclear localisati 


40 


36 


100.0 


7 


21 


AAY55821 


SV40 large T antig 


41 


36 


100.0 


7 


21 


AAY43812 


SV40 large T antig 


42 


36 


100.0 


7 


22 


ABG99123 


SV40 large T antig 


43 


36 


100.0 


7 


22 


AAM52169 


SV4 0 nuclear local 


44 


36 


100.0 


7 


22 


AAE08726 


SV40 large T antig 


45 


36 


100.0 


7 


22 


AAU05119 


HIV-1 integrase nu 



ALIGNMENTS 



RESULT 1 
AAR48253 

ID AAR48253 standard; peptide; 7 AA. 
XX 

AC AAR4 8253; 
XX 

DT 25-MAR-2003 (updated) 

DT 29-JUL-1994 (first entry) 

XX 

DE Subcellular localisation signal PKKKRKV. 
XX 

KW Single chain antibody; sFv; heavy chain; light chain; kappa; 

KW variable domain; hydrophilic linker; antibodies; organelle; 

KW endoplasmic reticulum retention; nuclear retention signal. 

XX 

OS Synthetic. 
XX 

PN WO9402610-A1. 

XX 

PD 03-FEB-1994. 
XX 

PF 16-JUL-1993; 93WO-US06735 • 
XX 

PR 17-JUL-1992; 92US-0916939 . 

PR 17-MAR-1993; 93US-0045274 . 
XX 

PA (DAND ) DANA FARBER CANCER INST INC. 
XX 

PI Haseltine WA, Marasco WA; 
XX 

DR WPI; 1994-048868/06. 
XX 

PT Intracellular binding of antigens - by using antibody targetting 

PT with vector system, for e.g. tumour suppression 

XX 

PS Claim 32; Page 101; 155pp; English. 
XX 

CC New vector systems comprise a sequence adapted for intracellular 

CC delivery and expression contg. a promoter operably linked to an 

CC antibody gene encoding an antibody which binds to a specific target 

CC antigen. The antibody is esp. a single chain antibody in which the 

CC heavy and light chain variable regions are joined via a hydrophilic 

CC linker peptide. Localisation sequences are pref. included in the 

CC constructs. See AAR48246-9 and AAR48252-3 for pref. (known) endoplasmic 

CC reticulum retention signals. N.B. the sequence AAR48253 is described 

CC as an ER retention signal in the Claims but as a nuclear retention 

CC signal in the disclosure. 

CC {Updated on 25-MAR-2003 to correct PN field.) 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 36; DB 15; Length 7; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

1 I I I I I I 



Db 1 PKKKRKV 7 



RESULT 2 
AAW11495 

ID AAW11495 standard; peptide; 7 AA. 
XX 

AC AAW11495; 
XX 

DT 02-APR-1997 (first entry) 
XX 

DE SV 40 large T antigen nuclear localisation sequence. 
XX 

KW Simian virus 40; SV40; large T antigen; presentation; eukaryotic; 

KW nuclear localisation sequence; cell nucleus; polynucleotide; 

KW gramicidin S; gene therapy; delivery; high; trans fection; 

KW efficiency. 

XX 

OS Rhesus macaque polyoma virus. 
XX 

PN WO9502397-A1. 
XX 

PD 26-JAN-1995. 
XX 

PF 14-JUL-1994; 94WO-US07916 . 
XX 

PR 14-JUL-1993; 93US-0092200 . 
XX 

PA (REGC ) UNIV CALIFORNIA, 
XX 

PI Haensler Szoka FC; 
XX 

DR WPI; 1995-067149/09. 
XX 

PT Compsns . for introducing polynucleotides into cells - comprising a 

PT dendrimer polycation operatively coupled to the polynucleotide, 

PT useful e.g. for gene therapy 
XX 

PS Disclosure; Page 28; 89pp; English. 
XX 

CC The present peptide is a simian virus 40 large T antigen derived 

CC nuclear localisation sequence (NLS) , which can be used to present a 

CC polynucleotide (PN) to an eukaryotic cell nucleus. A NLS/PN 

CC composition, which may further comprise a membrane permeabilising 

CC agent (e.g. gramicidin S) , an intercalating agent and a DNA masking 

CC agent, is useful in gene therapy and delivers the PN with high 

CC transfection efficiency. 

XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 36; DB 16; Length 7; 
Best Local Similarity 100. 0?^; Pred. No. 9.3e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I I I I I I I 

Db 1 PKKKRKV 7 



RESULT 3 
AAR80531 

ID AAR80531 standard; Peptide; 7 AA. 
XX 

AC AAR80531; 
XX 

DT 02-APR-1996 (first entry) 
XX 

DE SV40 derived nuclear localisation sequence #1. 
XX 

KW Moloney murine leukemia virus; gene therapy; retrovirus vector; 

KW nuclear localisation signal sequence. 

XX 

OS SV4 0. 
XX 

FN W09519368-A1. 

XX 

PD 20-JUL-1995. 
XX 

PF 12-JAN-1995; 95WO-US00129 . 
XX 

PR 14-JAN-1994; 94US-0181335 . 
XX 

PA (ALEX-) ALEXION PHARM INC. 
XX 

PI Fidel SA, Kennedy SP, Mason JM; 
XX 

DR WPI; 1995-263825/34. 
XX 

PT Retroviral vector particles comprising an onco-retroviral gag capsid 

PT protein - are used to transduce non-proliferating cells. 

XX 

PS Disclosure; Page 29; 61pp; English. 
XX 

CC This sequence represents the nuclear localisation signal from the 

CC SV40 large T antigen. This peptide may be used in the production of 

CC a mutated MoMLV oncoretroviral gag capsid protein. Retrovirus vectors 

CC comprising the mutated gag capsid protein can be used to transfer 

CC exogenous genes, preferably encoding therapeutic proteins, to non- 

CC proliferating cells including stem cells and neurons. The presence 

CC of the NLS allows the exogenous gene to enter into the nucleus of the 

CC target cell, thus allowing integration of the gene into the target cell 

CC genome. The retroviral vector can be used for ex vivo gene therapy of 

CC a broad range of inherited and acquired diseases. 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 36; DB 16; Length 7; 
Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I M I I I I 

Db 1 PKKKRKV 7 



RESULT 14 
AAW56400 

ID AAW56400 standard; peptide; 7 AA. 
XX 

AC AAW56400; 
XX 

DT 05-AUG-1998 (first entry) 
XX 

DE Nuclear localisation signal (NLS) . 
XX 

KW Signal peptide; nuclear localisation signal; NLS; 

KW irmnunosuppressive activity; inhibition; nuclear translocation inhibitor; 

KW nuclear translocation; treatment; immune disorder; autoimmune disease; 

KW hypersensitivity; sepsis; prevention; septic shock; antiviral agent; 

KW tumour growth suppressor. 
XX 

OS Unidentified. 
XX 

PN WO9811907-A. 
XX 

PD 2 6-MAR-1998. 

XX 

PF 15-SEP-1997; 97WO-US16217 . 
XX 

PR 12-SEP-1997; 97US-0928958 . 

PR 20-SEP-1996; 96US-0026978 . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Blake J, Cleaveland JS, Haffar OK, Nadler SG; 
XX 

DR WPI; 1998-217028/19. 
XX 

PT Nuclear translocation inhibitor polypeptides - comprising signal 

FT sequence for delivery through the cytoplasmic membrane and at least 

PT 2 nuclear localisation sequences 
XX 

PS Claim 10; Page 44; 69pp; English. 
XX 

CC Peptides AAW56400-13 represent nuclear localisation signals (NLSs) . The 

CC sequence is used to construct the nuclear translocation inhibitor 

CC polypeptide of the invention. Nuclear translocation inhibitor 

CC polypeptides comprise a signal sequence peptide capable of delivering 

CC the polypeptide through the cytoplasmic membrane into a cell, and at 

CC least 2 NLSs. The polypeptides can be used to inhibit nuclear 

CC translocation of a cellular protein. In addition, since the nuclear 

CC translocation of certain cellular peptides is required for the host 

CC organism to mount an immune response, the polypeptide inhibitors are 

CC useful as immunosuppression agents. The polypeptides can therefore be 

CC used for the treatment of immune disorders including autoimmune 

CC diseases. The polypeptides can also be used for treating physical 

CC symptoms manifested by responses to allergens which can initiate a state 

CC of hypersensitivity, for the treatment of sepsis and in the prevention 

CC of septic shock, antiviral agents, tumour growth suppressors, and for 

CC transcriptionally modulating the expression of cellular genes. 



XX 

SQ Sequence 7 AA; 



Query Match 100.0%; Score 36; DB 19; Length 7; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



RESULT 4 0 
AAY55821 

ID AAY55821 standard; peptide; 7 AA. 
XX 

AC AAYS5821; 
XX 

DT 06-MAR-2000 (first entry) 
XX 

DE SV4 0 large T antigen NLS sequence. 
XX 

KW Cellular protein; nuclear translocation; nuclear localization signal; 

KW iminunosuppressant ; immune response; viral infection; immune disorder; 

KW rheumatoid arthritis; multiple sclerosis; juvenile-onset diabetes; NLS; 

KW systemic lupus erytematosus ; SLE; asthma; sepsis; tumor growth; SV40; 

KW fibroblast growth factor; large T antigen. 
XX 

OS Rhesus macaque polyoma virus. 
XX 

PN W09957138-A1. 
XX 

PD ll-NOV-1999. 
XX 

PF 26-APR-1999; 99WO-US08984 . 
XX 

PR 04-MAY-1998; 98US-0072429 . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Nadler SG; 
XX 

DR WPI; 2000-062141/05. 
XX 

PT Novel composition comprising a nuclear localization signal and an 

PT inhibitor of nuclear translocation, useful for preventing transplanted 

PT organ or tissue rejection - 

XX 

PS Claim 8; Page 42; 68pp; English. 
XX 

CC The invention provides a composition comprising an isolated polypeptide 

CC (I) (polypeptide inhibitor of a cellular protein nuclear translocation) 

CC which comprises a signal sequence peptide capable of delivering (I) 

CC through a cytoplasmic membrane into the cell; and a nuclear localization 

CC signal sequence (NLS) present along with an immunosuppressant. (I) or 

CC its derivatives provide useful tools for introducing an exogenous 

CC polypeptide comprising an NLS into an intact cell to inhibit nuclear 



CC translocation of a cellular protein, for studying the role of nuclear 

CC translocation in the regulation of cellular processes. See AAY55812 fo 

CC detailed uses of (I) and compositions contianing (I). The present 

CC sequence represents a NLS sequence from SV40 large T antigen. 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 36; DB 21; Length 7; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



Search completed: November 12, 2003, 14:23:50 
Job time : 42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd, 



OM protein - protein search, using sw model 



Run on: November 12, 2003, 14:22:59 ; Search time 22 Seconds 

(without alignments) 
13.463 Million cell updates/sec 



Title: 

Perfect score : 
Sequence: 



US-10-031-505-6 
36 

1 PKKKRKV 7 



Scoring table; 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



328717 seqs, 42310858 residues 



Total number of hits satisfying chosen parameters: 



328717 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5A_COMB.pep:* 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3: /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/l/iaa/PCTUS__COMB.pep: * 

6: /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


36 


100. 


0 


7 


1 


US-08-347-792-14 


Sequence 


14, Appl 


2 


36 


100. 


0 


7 


1 


US-08-182-612B-1 


Sequence 


1, Appii 


3 


36 


100. 


0 


7 


1 


US-08-181-335B-1 


Sequence 


1, Appli 


4 


36 


100. 


0 


7 


1 


US-08-480-463-1 


Sequence 


1, Appli 


5 


36 


100. 


0 


7 


1 


US-08-664-856A-4 


Sequence 


4, Appli 


6 


36 


100. 


0 


7 


1 


US-08-240-514-3 


Sequence 


3, Appli 


7 


36 


100. 


0 


7 


1 


US-08-486-421-33 


Sequence 


33, Appl 


8 


36 


100. 


0 


7 


1 


US-08-801-796-4 


Sequence 


4, Appli 


9 


36 


100. 


0 


7 


1 


US-08-431-357-14 


Sequence 


14, Appl 


10 


36 


100. 


0 


7 


1 


US-08-470-911-33 


Sequence 


33, Appl 


11 


36 


100. 


0 


7 


2 


US-08-612-302A--3 


Sequence 


3, Appli 



12 


36 


100. 


0 


7 


2 


US- 


•08-931-877-4 


Sequence 


4/ Appli 


13 


36 


100. 


0 


7 


2 


us- 


08-373-190-22 


Sequence 


22, Appl 


14 


36 


100 . 


0 


7 


2 


us- 


08-281-423-13 


Sequence 


13, Appl 


15 


36 


100. 


0 


7 


2 


us- 


08-486-809-33 


Sequence 


33, Appl 


16 


36 


100. 


0 


7 


2 


us- 


08-928-958-10 


Sequence 


10, Appl 


17 


36 


100. 


0 


7 


2 


as- 


08-480-445A-1 


Sequence 


1, Appli 


18 


36 


100 . 


0 


7 


2 


us- 


•09-072-429-10 


Sequence 


10, Appl 


19 


36 


100. 


0 


7 


2 


us- 


-08-438-190A-22 


Sequence 


22, Appl 


20 


36 


100. 


0 


7 


2 


us- 


08-491-988-1 


Sequence 


1, Appli 


21 


36 


100 . 


0 


7 


2 


us- 


08-822-701-12 


Sequence 


12, Appl 


22 


36 


100. 


0 


7 


2 


us- 


08-480-446-1 


Sequence 


1, Appli 


23 


36 


100. 


0 


7 


3 


us- 


08-350-215-22 


Sequence 


22, Appl 


24 


36 


100. 


0 


7 


3 


us- 


08-487-799-82 


Sequence 


82, Appl 


25 


36 


100. 


0 


7 


3 


us- 


09-139-762A-125 


Sequence 


125, App 


26 


36 


100. 


0 


7 


3 


us- 


08-920-610-8 


Sequence 


8, Appli 


27 


36 


100. 


0 


7 


3 


us- 


09-069-484-4 


Sequence 


4, Appli 


28 


36 


100. 


0 


7 


3 


us- 


09-039-555B-9 


Sequence 


9, Appli 


29 


36 


100.0 


7 


3 


us- 


08-874-825-20 


Sequence 


20, Appl 


30 


36 


100. 


0 


7 


3 


us- 


08-935-855-12 


Sequence 


12, Appl 


31 


36 


100. 


0 


7 


3 


us- 


09-287-145A-22 


Sequence 


22, Appl 


32 


36 


100. 


0 


7 


3 


us- 


08-663-824-20 


Sequence 


20, Appl 


33 


36 


100. 


0 


7 


3 


us- 


09-007-020-1 


Sequence 


1, Appli 


34 


36 


100. 


0 


7 


3 


us- 


08-469-433B-1 


Sequence 


1, Appli 


35 


36 


100. 


0 


7 


3 


us- 


09-140-149-22 


Sequence 


22, Appl 


36 


36 


100. 


0 


7 


3 


us- 


08-789-333F-7 


Sequence 


7, Appli 


37 


36 


100. 


0 


7 


3 


us- 


09-169-015-17 


Sequence 


17, Appl 


38 


3 6 


100. 


0 


7 


3 


us- 


09-0o5-30d-3U 


Sequence 


30, Appl 


39 


36 


100. 


0 


7 


3 


us- 


09-242-131A-4 


Sequence 


4, Appli 


40 


36 


100. 


0 


7 


3 


us- 


08-850-744-11 


Sequence 


11, Appl 


41 


36 


100. 


0 


7 


3 


us- 


09-133-944-7 


Sequence 


7, Appli 


42 


36 


100. 


0 


7 


4 


us- 


09-615-283-4 


Sequence 


4, Appli 


43 


36 


100. 


0 


7 


4 


us- 


08-469-899A-1 


Sequence 


1, Appli 


44 


36 


100. 


0 


7 


4 


us- 


09-369-744-4 


Sequence 


4, Appli 


45 


36 


100. 


0 


7 


4 


us- 


09-411-706-3 


Sequence 


3, Appli 



ALIGNMENTS 



RESULT 1 

US-08-347-792-14 

; Sequence 14, Application US/08347792 
; Patent No. 5573925 
; GENERAL INFORMATION: 

; APPLICANT: Halazonetis, Thanos D. 

; TITLE OF INVENTION: p53 Proteins With Altered 

TITLE OF INVENTION: Tetrameri zation Domains 

NUMBER OF SEQUENCES: 37 
; CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Howson and Howson 

; STREET: Spring House Corporate Cntr., PO Box 457 

. CITY: Spring House 

; STATE: Pennsylvania 

; COUNTRY: USA 

ZIP: 19477 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 



; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1,0, Version SI. 25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/347,792 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: Bak, Mary E. 
; REGISTRATION NUMBER: 31,215 

REFERENCE/DOCKET NUMBER: WST58USA 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 215-540-9206 
TELEFAX: 215-540-5818 
; INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-347-792-14 

Query Match 100.0%; Score 36; DB 1; Length 7; 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

I I I M I I 
Db 1 PKKKRKV 7 



RESULT 4 
US-08-480-463-1 

; Sequence 1, Application US/08480463 

; Patent No. 5661025 

; GENERAL INFORMATION: 

; APPLICANT: SZOKA, FRANCIS C. 

; APPLICANT: HENSLER, JEAN 

TITLE OF INVENTION: SELF-ASSEMBLING POLYNUCLEOTIDE 
; TITLE OF INVENTION: DELIVERY SYSTEM COMPRISING DENDRIMER 
; TITLE OF INVENTION: POLYCATIONS 
; NUMBER OF SEQUENCES: 11 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: CROSBY HEAFEY ROACH & MAY 

STREET: 1999 HARRISON ST 

CITY: OAKLAND 

STATE: CA 

COUNTRY: USA 
; ZIP: 94612 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: WordPerfect 5.1/ASCII 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/4 80,463 



; FILING DATE: 07-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 08/092,200 

FILING DATE: 14-JUL-1993 
; ATTORNEY/ AGENT INFORMATION: 

NAME: KOENIG, NATHAN P. 

REGISTRATION NUMBER: 38,210 

REFERENCE/ DOCKET NUMBER: 13054.00110 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (510) 763-2000 

TELEFAX: (510) 273-8832 
; TELEX: 

; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; STRANDEDNESS : single 

TOPOLOGY: linear 
US-08-480-463-1 

Query Match 100.0%; Score 36; DB 1; Length 7; 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

I I I I I I 1 
Db 1 PKKKRKV 7 



RESULT 6 
US-08-240-514-3 

Sequence 3, Application US/08240514 
Patent No. 5670347 

GENERAL INFORMATION: 

APPLICANT: GOPAL, T. Venkat 

TITLE OF INVENTION: PEPTIDE-MEDIATED GENE TRANSFER 
NUMBER OF SEQUENCES: 56 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Foley & Lardner 
STREET: 3000 K Street, N.W., Suite 500 
CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
ZIP: 20007-5109 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: PatentIn Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/24 0, 514 
FILING DATE: ll-MAY-1994 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: BENT, Stephen A. 
REGISTRATION NUMBER: 29,768 



REFERENCE/DOCKET NUMBER; 7352 1/102/CLIN 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (202)672-5300 

TELEFAX: (202)672-5399 
TELEX: 904136 
; INFORMATION FOR SEQ ID NO: 3; 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; STRANDEDNESS : single 

; TOPOLOGY: linear 

US-08-240-514'3 

Query Match 100.0%; Score 36; DB 1; Length 7; 

Best Local Similarity 100,0%; Pred. No. 2.5e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



RESULT 19 
US-08-438-190A-22 

; Sequence 22, Application US/08438190A 
; Patent No. 5965371 
; GENERAL INFORMATION: 

APPLICJ\NT: MARASCO, WAYNE 

APPLICANT: HASELTINE, WILLIAM 

TITLE OF INVENTION: METHOD OF INTRACELLULAR BINDING OF 
TITLE OF INVENTION: PROTEINS 
NUMBER OF SEQUENCES: 78 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DAVID G. CONLIN; DIKE, BRONSTEIN, ROBERTS & 
ADDRESSEE: CUSHMAN 
STREET: 130 WATER STREET 
; CITY: BOSTON 

STATE: MASSACHUSETTS 
COUNTRY: US 
ZIP: 02109 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1,0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/438 , 190A 
; FILING DATE: 

; CLASSIFICATION: 435 

; ATTORNEY/AGENT INFORMATION: 

NAME: EISENSTEIN, RONALD I. 
REGISTRATION NUMBER: 30628 
; REFERENCE/ DOCKET NUMBER: 41956 

; TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 523-3400 
TELEFAX: (617) 523-6440 



TELEX: 200291 STRE UR 
; INFORMATION FOR SEQ ID NO: 22: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

US-08-438-190A-22 

Query Match 100.0%; Score 36; DB 2; Length 7 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



RESULT 20 

US-08-491-988-1 

; Sequence 1, Application US/08491988 
; Patent No. 5973116 
; GENERAL INFORMATION: 

APPLICANT: EPENETOS, AGAMEMNON A. 
APPLICANT: SPOONER, ROBERT A. 
APPLICANT: DEONARAIN, MAHENDRA 
TITLE OF INVENTION: Compounds for targeting 
; NUMBER OF SEQUENCES: 29 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: McAULAY NISSEN GOLDBERG KIEL & HAND, LLP 
STREET: 261 MADISON AVENUE 
CITY: NEW YORK 
; STATE: NY 

COUNTRY: USA 
ZIP: 10016-2391 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/491,988 

FILING DATE: 18-DEC-1995 
CLASSIFICATION: 424 
ATTORNEY/AGENT INFORMATION: 
NAME: GOLDBERG, JULES E. 
; REGISTRATION NUMBER: 24,408 

; TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-986-4090 
TELEFAX: 212-818-9479 
; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
; HYPOTHETICAL: NO 



US-08-491-988-1 



Query Match 100.0%; Score 36; DB 2; Length 7; 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



RESULT 2 3 
US-08-350-215-22 

; Sequence 22, Application US/08350215 

; Patent No. 6004940 

; GENERAL INFORMATION: 

; APPLICANT: MARASCO, WAYNE A. 

; APPLICANT: RICHARDSON, JENNIFER 

TITLE OF INVENTION: INTRACELLULAR TARGETING OF ENDOGENOUS 
; TITLE OF INVENTION: PROTEINS 
NUMBER OF SEQUENCES: 58 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: DAVID G. CONLIN; DIKE, BRONSTEIN, ROBERTS & 
; ADDRESSEE: CUSHMAN 

STREET: 130 WATER STREET 
CITY: BOSTON 
STATE: MASSACHUSETTS 
; COUNTRY: US 

; ZIP: 02109 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/350,215 

; FILING DATE: 12-DEC-1994 

CLASSIFICATION: 514 
ATTORNEY/AGENT INFORMATION: 
NAME: EISENSTEIN, RONALD I. 
; REGISTRATION NUMBER: 30628 

REFERENCE/ DOCKET NUMBER: 41956-CP3 
; TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 523-3400 
TELEFAX: (617) 523-6440 
; TELEX: 200291 STRE UR 

; INFORMATION FOR SEQ ID NO: 22: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

US-08-350-215-22 



Query Match 100.0%; Score 36; DB 3; Length 7; 

Best Local Similarity 100,0%; Pred, No. 2.5e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 

Search completed: November 12, 2003, 14:26:09 
Job time : 22 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



November 12, 2003, 14:25:15 ; Search time 29 Seconds 

(without alignments) 
41.457 Million cell updates/sec 

US-10-031-505-6 
36 

1 PKKKRKV 7 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

644079 seqs, 171749292 residues 



Total number of hits satisfying chosen parameters: 644079 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA: * 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep: * 

3: /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4 : / cgn2_6/ptodata/ 1 /pubpaa/US06_PUBCOMB . pep : * 

5: /cgn2__6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep:-*' 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: * 

10: /cgn2__6/ptodata/l/pubpaa/US09B_PUBCOMB.pep: * 

11 : /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep: * 

12 : /cgn2__6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 

13: /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep:* 

14 : /cgn2_6/ptodata/l/pubpaa/US10B__PUBCOMB.pep: * 

15 : /cgn2_6/ptodata/ l/pubpaa/US10C_PUBCOMB . pep : * 

16: /cgn2__6/ptodata/l/pubpaa/US10_NEW_PUB.pep:* 

17: /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep:* 

18: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 


96, Appl 
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Sequence 


102, App 
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Sequence 


12, Appl 
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Sequence 


7, Appli 
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Sequence 


35, Appl 
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Sequence 


56, Appl 
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Sequence 


12, Appl 
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Sequence 


30, Appl 
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4, Appli 
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53, Appl 
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224, App 
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13, Appl 
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Sequence 
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Sequence 
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Sequence 


3, Appli 



ALIGNMENTS 



RESULT 1 

US-08-873-601-3 

; Sequence 3, Application US/08873601 
; Publication No, US20020064798A1 
; GENERAL INFORMATION; 

; APPLICANT: No. US20020064 7 98Alan, Garry P. 



; APPLICANT: Payan, Donald 

; TITLE OF INVENTION: COMBINATORIAL ENZYMATIC COMPLEXES 

; FILE REFERENCE: A-639 15/DJB/RMS 

; CURRENT APPLICATION NUMBER: US/08/873,601 

; CURRENT FILING DATE: 1997-06-12 

; NUMBER OF SEQ ID NOS : 35 

SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 3 
; LENGTH: 7 
; TYPE: PRT 
; ORGANISM: SV 4 0 
US-08-873-601-3 

Query Match 100.0%; Score 36; DB 8; Length 7; 

Best Local Similarity 100.0%; Pred. No. 5.8e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

11)1111 
Db 1 PKKKRKV 7 



RESULT 7 

US-09-825-012-96 

; Sequence 96, Application US/09825012 

; Patent No. US20020122798A1 

; GENERAL INFORMATION: 

; APPLICANT: Young, Robert 

; TITLE OF INVENTION: Compounds for Targeting 

; FILE REFERENCE: 43191-256808 

; CURRENT APPLICATION NUMBER; US/09/825,012 

; CURRENT FILING DATE: 2001-04-03 

; PRIOR APPLICATION NUMBER: US 60/237,159 

; PRIOR FILING DATE: 2000-10-02 

; PRIOR APPLICATION NUMBER: GB 0008049.9 

; PRIOR FILING DATE: 2000-04-03 

; NUMBER OF SEQ ID NOS: 102 

; SOFTWARE: Patentin version 3.1 

; SEQ ID NO 96 

LENGTH: 7 

TYPE: PRT 
; ORGANISM: Artificial Sequence 
; FEATURE : 

OTHER INFORMATION: Nuclear localisation signal 
US-09-825-012-96 

Query Match 100.0%; Score 36; DB 10; Length 7; 

Best Local Similarity 100.0%; Pred. No. 5.8e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 PKKKRKV 7 

I I I I I I I 
Db 1 PKKKRKV 7 



RESULT 19 
US-09-911-569-2 

; Sequence 2, Application US/09911569 
; Publication No. US20030069173A1 
GENERAL INFORMATION: 

APPLICANT: HAWLEY-NELSON, PAMELA 
; LAN, JIANQING 

; SHIH, POJEN 

JESSE, JOEL A. 
SCHIFFERLI, KEVIN P. 
GEBEYEHU, GULILAT 
TITLE OF INVENTION: PEPTIDE-ENHANCED TRANSFECTIONS 
NUMBER OF SEQUENCES: 120 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: GREENLEE, WINNER & SULLIVAN 

STREET: 5370 MANHATTAN CIRCLE, SUITE 201 
CITY: BOULDER 
; STATE: CO 

COUNTRY: US 
ZIP: 80303 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: PatentIn Release #1.0, Version #1,30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/911,569 

FILING DATE: 23-Jul-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 09/039,780 
FILING DATE: 16-MAR-1998 
ATTORNEY/AGENT INFORMATION: 
; NAME: SULLIVAN, SALLY A. 

; REGISTRATION NUMBER: 32,064 

REFERENCE/DOCKET NUMBER: 32-95D 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (303)499-8080 
TELEFAX: (303)4 99-8 089 
; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: linear 

MOLECULE TYPE; peptide 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-09-911-569-2 

Query Match 100.0%; Score 36; DB 11; Length 7 

Best Local Similarity 100,0%; Pred, No. 5,8e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 



Qy 



1 PKKKRKV 7 



Db 



I I I I I I I 
1 PKKKRKV 7 



RESULT 21 
US-09-897-844-35 

Sequence 35, Application US/09897844 
Publication No. US20030087817A1 
GENERAL INFORMATION: 
APPLICANT: Cox III, George No. US20030087817Albert 
APPLICANT: Case, Casey Christopher 
APPLICANT: Eisenberg, Stephen P. 
APPLICANT: Jarvis, Eric Edward 
APPLICANT: Spratt, Sharon Kaye 
APPLICANT: Sangamo Biosciences, Inc. 

TITLE OF INVENTION: Regulation of Endogenous Gene Expression in Cells Using 
TITLE OF INVENTION: Zinc Finger Proteins 
FILE REFERENCE: 0194 96-002200US 
CURRENT APPLICATION NUMBER: US/ 09/ 897 , 844 
CURRENT FILING DATE: 2001-07-02 
PRIOR APPLICATION NUMBER: 09/229,037 
PRIOR FILING DATE: 1999-01-12 
NUMBER OF SEQ ID NOS : 4 0 
SOFTWARE: PatentIn Ver. 2.0 
SEQ ID NO 35 
LENGTH: 7 
TYPE: PRT 
ORGANISM 
FEATURE: 

NAME/KEY: PEPTIDE 
LOCATION: (1) . . (7) 
OTHER INFORMATION: 
OTHER INFORMATION: 
US-09-897-844-35 



Simian virus 40 



SV40 large T antigen nuclear localization sequence 
(NLS) 



Query Match 100.0%; Score 36; DB 11; Length 7; 

Best Local Similarity 100.0%; Pred. No. 5.8e+05; 
Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 PKKKRKV 7 

I I I I I I I 
1 PKKKRKV 7 



Search completed: November 12, 2003, 14:30:25 
Job time : 29 sees 



